On Batch-Scheduling Dags for Internet-Based Computing

نویسندگان

  • Grzegorz Malewicz
  • Arnold L. Rosenberg
چکیده

The process of scheduling computations for Internet-based computing presents challenges not encountered with more traditional platforms for parallel and distributed computing. The looser coupling among participating computers makes it harder to utilize remote clients well and also raise the specter of a kind of “gridlock” that ensues when a computation stalls because no new tasks are eligible for execution. This paper studies the problem of scheduling computation-dags in a manner that renders tasks eligible for allocation to remote clients (hence for execution) at the maximum possible rate. Earlier work has developed a framework for studying this problem when a new task is allocated to a remote client as soon as it returns the results from an earlier-allocated task. The proof in that work that many dags cannot be scheduled optimally within this scheduling paradigm demonstrated the need for a companion scheduling theory that addresses the scheduling problem for all computation-dags. A new, batched, scheduling paradigm for InternetComputing is developed in this work. Although optimal batched schedules exist for every computation-dag, computing (successive steps of) such a schedule is shown to be NP-Hard, even for bipartite dags. In response, a polynomial-time algorithm is developed for producing optimal batched schedules for a rich family of dags that are constructed by “composing” tree-structured building-block dags. Finally, a fast heuristic schedule is developed for a class of “expansive” dags

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On Scheduling Collaborative Computations on the Internet, I: Mesh-Dags and Their Close Relatives

Advancing technology has rendered the Internet a viable medium for collaborative computing, via mechanisms such as Web-Based Computing and Grid-Computing. We present a “pebble game” that abstracts the process of scheduling a computation-dag for computing over the Internet, including a novel formal criterion for comparing the qualities of competing schedules. Within this formal setting, we ident...

متن کامل

Assessing the Computational Benefits of AREA-Oriented DAG-Scheduling

Many modern computational platforms, including “aggressive” multicore architectures, proposed exascale architectures, and many modalities of Internet-based computing are “task hungry”—their performance is enhanced by always having as many tasks eligible for allocation to processors as possible. The IC-scheduling paradigm for computations with inter-task dependencies—modeled as DAGs—was develope...

متن کامل

Optimization Task Scheduling Algorithm in Cloud Computing

Since software systems play an important role in applications more than ever, the security has become one of the most important indicators of softwares.Cloud computing refers to services that run in a distributed network and are accessible through common internet protocols. Presenting a proper scheduling method can lead to efficiency of resources by decreasing response time and costs. This rese...

متن کامل

Improving the palbimm scheduling algorithm for fault tolerance in cloud computing

Cloud computing is the latest technology that involves distributed computation over the Internet. It meets the needs of users through sharing resources and using virtual technology. The workflow user applications refer to a set of tasks to be processed within the cloud environment. Scheduling algorithms have a lot to do with the efficiency of cloud computing environments through selection of su...

متن کامل

Advances in a Dag-Scheduling Theory for Internet-Based Computing

Earlier work has developed the underpinnings of a theory of scheduling computations having intertask dependencies—modeled via dags—for Internet-based computing. The goal of the schedules produced is to render tasks eligible for execution at the maximum possible rate. This goal aims: (a) to utilize remote clients’ computational resources well, by always having work to allocate to an available cl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006